A new spectral transformation for speaker normalization
نویسندگان
چکیده
This paper proposes a new spectral transformation for speaker normalization. We use the Bilinear Transformation (BLT) to introduce a new frequency warping resulting from a mapping of a prototype Band-Pass (BP) filter into a general BP filter. This new transformation called “Band-Pass Transform” (BPT) offers two degrees of freedom enabling complex warpings of the frequency axis and different from previous works with BLT. A procedure based on the Nelder-Mead algorithm is proposed to estimate the BPT parameters. Our experimental results include a detailed study of the performance of the BPT compared to other VTLN methods for a subset of speakers and results on large test sets. BPT performs better than other VTLN methods and offers a gain of 1.13% absolute on Hub-5 English Eval01 set.
منابع مشابه
Speaker Normalization for Improved Automatic Speech Recognition for Digital Libraries
SPEAKER NORMALIZATION FOR IMPROVED AUTOMATIC SPEECH RECOGNITION FOR DIGITAL LIBRARIES Wei Wang Old Dominion University, 2004 Director: Dr. Stephen A. Zahorian The context of the thesis work is the improvement of automatic speech recognition (ASR) for use with digital libraries. First, commonly used multimedia file formats and codecs are surveyed with the objective of identifying those formats t...
متن کاملLinear discriminant - a new criterion for speaker normalization
In Vocal Tract Length Normalization (VTLN) a linear or nonlinear frequency transformation compensates for different vocal tract lengths. Finding good estimates for the speaker specific warp parameters is a critical issue. Despite good results using the Maximum Likelihood criterion to find parameters for a linear warping, there are concerns using this method. We searched for a new criterion that...
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملEfficient Speaker and Noise Normalization for Robust Speech Recognition
In this paper, we describe a computationally efficient approach for combining speaker and noise normalization techniques. In particular, we combine the simple yet effective Histogram Equalization (HEQ) for noise compensation with Vocal-tract length normalization (VTLN) for speaker-normalization. While it is intuitive to remove noise first and then perform VTLN, this is difficult since HEQ perfo...
متن کاملSpectral normalization employing hidden Markov modeling of line spectrum pair frequencies
This paper proposes a spectral normalization approach in which the acoustical qualities of an input speech waveform are mapped onto that of a desired neutral voice. Such a method can be e ective in reducing the impact of speaker variability such as accent, stress, and emotion for speech recognition. In the proposed method, the transformation is performed by modeling the temporal characteristics...
متن کامل